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Abstract. The consistency of Frechet medians is proved for probabil- 
ity measures in proper metric spaces. In the context of Riemannian 
manifolds, assuming that the probability measure has more than a half 
mass lying in a convex ball and verifies some concentration conditions, 
the positions of its Frechet medians are estimated. It is also shown 
that, in compact Riemannian manifolds, the Frechet sample medians of 
generic data points are always unique. 



1. Introduction 

The history of medians can be dated back to 1629 when P. Fermat initi- 
ated a challenge (see [9]): given three points in the plan, find a fourth one 
such that the sum of its distances to the three given points is minimum. The 
answer to this question, which was firstly found by E. Torricelli (see [17j ) 
in 1647, is that if each angle of the triangle is smaller than 27r/3, then the 
minimum point is such that the three segments joining it and the vertices of 
the triangle form three angles equal to 2tt/3; and in the opposite case, the 
minimum point is the vertex whose angle is greater than or equal to 2n/3. 
This point is called the median or the Fermat point of the triangle. 

The notion of median also appears in statistics since a long time ago. In 
1774, when P. S. Laplace tried to find an appropriate notion of the middle 
point for a group of observation values, he introduced in [T3] "the middle of 
probability" , the point that minimizes the sum of its absolute differences to 
data points, this is exactly the one dimensional median. 

A sufficiently general notion of median in metric spaces is proposed in 
1948 by M. Frechet in his famous article [TU] , where he defines a p-mean of 
a random variable X to be a point which minimizes the expectation of its 
distance at the power p to X. In practice, two important cases are p = 1 and 
p = 2, which correspond to the notions of Frechet median and Frechet mean, 
respectively. Probably the most significant advantage of the median over the 
mean is that the former is robust but the latter is not, that is to say, the 
median is much less sensitive to outliers than the mean. Roughly speaking 
(see p2]), in order to move the median of a group of data points to arbitrarily 
far, at least a half of data points should be moved. On the contrary, in order 
to move the mean of a group of data points to arbitrarily far, it suffices to 
move one data point. So that medians are in some sense more prudent 
than means, as argued by M. Frechet. The robustness property makes the 
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median an important estimator in situations when there are lots of noise 
and disturbing factors. 

Under the framework of Riemannian manifolds, the existence and unique- 
ness of local medians are proved in |19j for probability measures whose sup- 
port are contained in a convex geodesic ball. It should be noted that, if 
the local curvature conditions in [19] are replaced by global ones, then it 
is shown in pQ that the local medians are in fact global medians, that is, 
Frechet medians. Stochastic algorithms and deterministic algorithms for 
computing medians can be found in [3] and [15] . 

The aim of this paper is to give some basic properties of Frechet medians. 
Firstly, we consider the question of consistency, which is important on the 
estimation of location. Theorem 12.31 states that, in proper metric spaces, if 
the first moment functions of a sequence of probability measures converge 
uniformly to the first moment function of another probability measure, then 
the corresponding sequence of Frechet median sets also converges to the 
Frechet median sets of the limiting measure. As a result, if a probability 
measure has only one Frechet median, then any sequence of empirical Frechet 
medians will converge almost surely to it. In the second section, we study 
the robustness of Frechet medians in Riemannian manifolds. In Euclidean 
spaces, it is shown in [T2] that if a group of data points has more than a 
half concentrated in a bounded region, then its Frechet median cannot be 
drown arbitrarily far when the other points move. A generalization and 
refinement of this result for data points in Riemannian manifolds is given 
in Theorem 13.141 where an upper bound of the furthest distance to which 
Frechet medians can move is given in terms of the upper bound of sectional 
curvatures, the concentration radius and the concentrated mass of the prob- 
ability measure. This theorem also generalizes a result in [1] which states 
that if the probability measure is supported in a strongly convex ball, then 
all its Frechet medians lie in that ball. Moreover, though we have chosen the 
framework of studying the robustness of Frechet medians to be Riemannian 
manifolds, it is easily seen that our results remain true for general CAT(A) 
spaces. Finally, the uniqueness question of Frechet sample medians is con- 
sidered in the context of compact Riemannian manifolds. It is shown that, 
apart from several events of probability zero, the Frechet sample medians 
are unique if the sample vector has a density with respect to the canonical 
Lebesgue measure of the product manifold. In other words, the Frechet 
medians of generic data points are always unique. 

2. CONSISTENCY OF FRECHET MEDIANS IN METRIC SPACES 

Let (M, d) be a proper metric space (recall that a metric space is proper if 
and only if every bounded and closed subset is compact) and P\{M) denote 
the set of all the probability measures pnM verifying 



d(xo,p)(i(dp) < oo, for some xq £ M. 



M 



For every /i G P\(M) we can define a function 



ffj. '■ 
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This function is 1-Lipschitz hence continuous on M. Since M is proper, 
attains its minimum (see [161 P- 42]), so we can give the following definition: 

Definition 2.1. Let /i be a probability measure in Pi(M), then a global 
minimum point of /„ is called a Frechet median of /i. The set of all the 
Frechet medians of [i is denoted by Q^. Let /* denote the global minimum 

"f ./;, 

Observe that is compact, since the triangle inequality implies that 
d(x,y) < 2f* for every x,y G Q^. 

To introduce the next proposition, let us recall that the L 1 -Wasserstein 
distance between two elements /x and v in P\(M) is defined by 



W 1 (n,v)= inf / d(x,y)dir(x,y), 

TrelR^t/) JmxM 

where v) is the set of all the probability measures on M x M with 
margins fj, and u. As a useful case for us, observe that f^(x) = Wi(8 x ,[i) 
for every x G M. The set of all the 1-Lipschitz functions on M is denoted 
by Li Pl (M). 

As is well known that Riemannian barycenters are characterized by convex 
functions (see [12l Lemma 7.2]), the following proposition shows that Frechet 
medians can be characterized by Lipschitz functions. 

Proposition 2.2. Let /i G P\{M) and M be also separable, then 

Q M = jx G M : <p{x) <f* + f^ <p{p)fi(dp), for every tp G Li Pl (M)|. 

Proof. The separability of M ensures that the duality formula of Kantorovich- 
Rubinstein (see [181 P- 107]) can be applied, so that for every x G M, 

* g Q M ^ U(x) < f; 



sup 

<^GLip 1 (M) 



<p(x) - / <p{p)n(dp) 



M 



<S=^ <p(x) < f* + / (p(p)fj,{dp), for every <p G Lip 1 (M), 
Jm 

as desired. □ 

We proceed to show the main result of this section. 

Theorem 2.3. Let (/i n )neN be a sequence in P\{M) and \x be another prob- 
ability measure in P\{M). If (/u n )n converges uniformly on M to f^, then 
for every e > 0, there exists N G N, such that for every n> N we have 

Qfj, n C B(Q^e) :={xeM: d{x,Q^) < e}. 

Proof. We prove this by contradiction. Suppose that the assertion is not 
true, then without loss of generality, we can assume that there exist some 
£ > and a sequence (£ n )neN such that x n G Q^ n and x n ^ B(Q fl , e) for 
every n. If this sequence is bounded, then by choosing a subsequence we can 
assume that (x n ) n converges to a point ^ B(Q fM ,e) because M\B(Q fl ,e) 
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is closed. However, observe that the uniform convergence of (/ /Jn ) n to f a 
implies /* — > /*, hence one gets 

\UM - f;\ < \u(x*) - f^(x*)\ + \um - f^( Xn )\ + \f, n ( Xn ) - /;i 

< sup |/ Mn (x) - U(x)\+d(x*,x n ) + |/* n - f*\ — > 0. 

So that /^(x*) = /*, that is to say x* G Q^. This is impossible, hence 
(x n ) n is not bounded. Now we fix a point x G Q^, always by choosing a 
subsequence we can assume that d{x n ,x) — > +00, then 

ffi( x n) = / d(x n ,p)n(dp) > / (d(x n ,x) - d(x,p))n(dp) 
Jm Jm 

= d(x n , x) - f* — > +00. (1) 
On the other hand, 

lUiXn) ~ f*a\ < \ftt(Xn) ~ fltn(x n )\ + \fiMniXn) ~ f£\ 

< sup \f, n ( x ) - u( x )\ + \f; n - f;\ — > 0. 

This contradicts (P), the proof is complete. □ 

Remark 2.4. A sufficient condition to ensure the uniform convergence of 
(ftin)n on M to ffj, is that PFi(/u n ,/x) — > 0, since 

sup l/^Csc) - fn{x)\ = sup iWi^,^) - Wi{5 x ,n)\ < W x (p, n ,n). 

xeM x&M 

The consistency of Frechet means is proved in [5j Theorem 2.3]. The 
consistency of Frechet medians given below is a corollary to Theorem 12.31 
A similar result can be found in [16, p. 44]. 

Corollary 2.5. Let (X n ) nS N be a sequence of i.i.d random variables of 
law /i G Pi(M) and (m n ) n£ N &e a sequence of random variables such that 
mn G Q Un with [i n = \ Sfe=i ^ffc • A 4 ^ as a unique Frechet median m, 
then m n — > m a.s. 

Proof. By Theorem 12.31 and Remark 12.41 it suffices to show that [i n — ^ fi 
a.s. This is equivalent to show that (see [181 P- 108]) for every / G Cb(M), 

1 n r 
_V/(X fc )— ► / /(p) M (dp) a.s. 

and for every x £ M, 

1 n /" 

-V]ii(a;,X fc ) — >■ / d(x,p)fi(dp) a.s. 

These two assertions are trivial corollaries to the strong law of large numbers, 
hence the result holds. □ 
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3. ROBUSTNESS OF FRECHET MEDIANS IN RIEMANNIAN MANIFOLDS 

Throughout this section, we assume that M is a complete Riemannian 
manifold with dimension no less than 2, whose Riemannian distance is de- 
noted by d. We fix a closed geodesic ball 

B(a, p) = {x G M : d(x, a) < p} 

in M centered at a with a finite radius p > and a probability measure 
p G Pl(M) such that 

p(B(a,p)) = a > -. 

The aim of this section is to estimate the positions of the Frechet medians 
of p, which gives a quantitative estimation for robustness. To this end, the 
following type of functions are of fundamental importance for our methods. 
Let x,z £ M, define 

h X)Z : B(a,p) — > R, pi — > d(x,p) - d(z,p). 

Obviously, h x z is continuous and attains its minimum. 

Our method of estimating the position of Q M is essentially based on the 
following simple observation. 

Proposition 3.1. Let x G B(a,p) c and assume that there exists z G M 
such that 

min h x Jp) > d(x,z), 

p£B(a,p) a 

then x ^ Q^. 

Proof. Clearly one has 

fix(x) - fpiz) = / (d(x,p) - d(z,p))p(dp) + / (d(x,p) - d(z,p))p(dp) 

JB(a,p) JM\B(a,p) 

> a min h x z (p) — (1 — a)d(x, z) > 0. 

peB(a,p) 

The proof is complete. □ 

By choosing the dominating point z = a in Proposition 13.11 we get the 
following basic estimation. 

Theorem 3.2. The set of all the Frechet medians of p verifies 

Proof. Observe that for every p G B(a, p), 

h x ,a{p) = d(x,p) — d(a,p) > d(x,a) — 2d(a,p) > d(x,a) — 2p. 
Hence Proposition 13.11 yields 

1 — a 

Qa fl B(a,p) c C {x G M : min h x a (p) < d(x,a)\ 

peB(a, P ) ' a 

C {x G M : d(x, a) - 2p < - — - d(x, a)} 

a 

= {xeM :d(x,a)<^^}. 
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The proof is complete. □ 

Remark 3.3. It is easily seen that the conclusions of Proposition 13.11 and 
Theorem 13.21 also hold if M is only a proper metric space. 

Remark 3.4. As a direct corollary to Theorem 13.21 if /i is a probability 
measure in P\(M) such that for some point m £ M one has p{m} > 1/2, 
then m is the unique Frechet median of p. 

Thanks to Theorem 13.21 from now on we only have to work in the closed 
geodesic ball 

B. = B(a 2ap 



2a - 1 / 

Thus let A be an upper bound of sectional curvatures in and inj be the 
injectivity radius of B*. Moreover, we shall always assume that the following 
concentration condition is fulfilled throughout the rest part of this section: 



Assumption 3.5. 



2ap . , 7r . 

< r* := min{-p , inj j, 



2a - 1 L 

where if A < 0, then 7r/\/A is interpreted as +oo. 

In view of Proposition 13.11 and Theorem [321 estimating the position of 
can be achieved by estimating the minimum of the functions h x z for some 
x,z G B*. The following lemma enables us to use the comparison argument 
proposed in [1] to compare the configurations in B* with the ones in model 
spaces in order to obtain lower bounds of the functions h xz . 

Lemma 3.6. Let x 6 B* \ B(a,p) and y be the intersection point of the 
boundary of B(a,p) and the minimal geodesic joining x and a. Let z ^ x 
be another point on the minimal geodesic joining x and a. Assume that 
d(a, x) + d(a, z) < r*, then 

argmin h X:Z C{p6 B(a, p) : d(x,p) + d(p, z) + d(z, x) < 2r*}. 

Proof. Let p £ B(a, p) such that d(x,p) + d(p, z) + d(z, x) > 2r*, then 

h x ,z(p) - 2r * ~ rf ( x > z ) ~ 2d ( z 'P) 

> 2(d(a, x) + d(a, z)) - d(x, z) - 2(d(a, z) + p) 

= d(x,y) - d(a,y) +d(a,z). (2) 

If d(a,y) > d(a,z), then ^ yields h XjZ {p) > h XiZ (y), thus p cannot be a 
minimum point of h XjZ . On the other hand, if d(a, y) < d(a, z), then ^fy gives 
that h XjZ (p) > d(x, y)+d(y, z) > d(x, z), which is impossible. Hence in either 
case, every minimum point p of h XjZ must verify d(x,p) + d(p, z) + d(z, x) < 
2rv □ 

As a preparation for the comparison arguments in the following, let us 
recall the definition of model spaces. For a real number k, the model space 
is defined as follows: 

1) if k > 0, then is obtained from the sphere S 2 by multiplying the 
distance function by 1/ yfn; 

2) if k = 0, then M 2 . is the Euclidean space E 2 ; 
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3) if k < 0, then M 2 is obtained from the hyperbolic space H 2 by multiplying 
the distance function by 1/ \J — k. 

Moreover, the distance between two points x and y in M 2 will be denoted 
by d(x,y). 

The following proposition says that for the positions of Frechet medians, 
if comparisons can be done, then the model space M 2 ^ is the worst case. 

Proposition 3.7. Consider in M 2 ^ the same configuration as that in Lemma 
1 5 1 . 61 a closed geodesic ball B(a,p) and a point x such that d(x,a) = d(x,a). 
We denote y the intersection point of the boundary of B(a,p) and the mini- 
mal geodesic joining x and a. Let z be a point in the minimal geodesic joining 
x and a such that d(d,z) = d(a,z). Assume that d(a,x) + d(a,z) < r*, then 

min h XjZ {jp) > min h X;S (p), 

pGB(a,p) p£B(a,p) 

where h x ,z(p) '■= d(x,p) — d(z,p). 

Proof. Let p G argmin/i X)2 . Consider a comparison point p G M 2 ^ such that 
d(z,p) = d(z,p) and Z dzp = Z azp. Then the assumption d(a, x) + d(a, z) < 
r* and the hinge version of Alexandrov-Toponogov comparison theorem (see 
Exercise IX. 1, p. 420]) yield that d(a,p) < d(a,p) = p, i.e. p G B(a,p). 
Now by hinge comparison again and Lemma I3T61 we get d(p,x) < d(p,x), 
which implies that 

h x ,z(p) > hx,z(p) > min hx, z {p)- 

p£B(a,p) 

The proof is complete. □ 

According to Proposition l3.7l it suffices to find the minima of the functions 
h x , z when M equals S 2 , E 2 and H 2 , which are of constant curvatures 1, 
and —1, respectively. 

Proposition 3.8. Let t, u > such that u < p + t < 2ap/ (2a — 1) . 

i)IfM = E> 2 , let x = (sin(p + t), 0, cos(p + i)) and z = (sin u,0, cos u). 
Assume that p + t + u < n, then 

t — p + u, if cot u > 2 cot p — cot(p + t); 



min h x z 



B(a, P ) 



sin psm {p + 1 — i_, 

arccos cos(p + t — u) H — , if not. 

2 sin u sm(p + 1) 



ii) IfM = E 2 , let a = (0, 0), x = {p + t, 0), z = (u, 0), then 



t-p + u, if u< — — ; 

p + 2t 



min h x z 

B(a,p) ' 



P 2 

(p + t-u)Jl — — r, if not. 

V u{p + t) 
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Hi) If M = M 2 , let a = (0,0,1), x = (sinh(p + t), 0, cosh(p + t)) and 
z = (sinh 14, 0, cosh 14), then 

t — p + u, if coth u > 2 coth p — coth(/> + t); 



min h x z 

B(a,p) 



sinh p sinh (p + t — u) 

arccosh cosh(/5 + t — u) — r — , if not. 

2 smh u sinh(p + 1 ) 



We shall only prove the result for the case when M = 8 2 , since the proofs 
for M = E 2 and M = H 2 are similar and easier. The proof consists of some 
lemmas, the first one below says that h x z is smooth at its minimum points 
which can only appear on the boundary of the ball B(a, p). 

Lemma 3.9. Let x' and z' be the antipodes of x and z. Then z' ^ B(a,p) 
and all the local minimum points of h X)Z are contained in dB(a,p) \ {x'}. 

Proof. It is easily seen that d(z', a) = ir—u > p, so that z' £ B(a, p). Observe 
that x' is a global maximum point of h x z which is not locally constant, so 
that x' cannot be a local minimum. Now let p € B(a, p) be a local minimum 
of h X)Z , then h x , z is smooth at p. It follows that grad h x ^ z (p) = 0, which yields 
that h x ^ z {p) = d(x,z), this is a contradiction. The proof is complete. □ 

The following lemma characterizes the global minimum points of h x z . 

Lemma 3.10. The set of global minimum points of h X)Z verifies 

{{y}, if cot u > 2 cot p — cot(p + t); 
sin(/j + t) sinn 
{p G dB(a,p) : -—7 7=^-77 r}, if not, 

sm a{x,p) sm a{z,p) 

where y is the intersection point of the boundary of B(a,p) and the minimal 
iesic joining x and a. 



Proof. Thanks to Lemma 13.91 h suffices to find the global minimum points 
of h x>z for p = (sin p cos 6, sin p sin 6, cos/?) and 6 G [0, 2ir). In this case, 

h x , z (p) =d{x,p) - d(z,p) 

= arccos(sin(p + t) sin p cos 9 + cos(p + t) cos p) 

— arccos(sin u sin p cos 9 + cos u cos p) 
:=h(9). 

Hence let p = (sin p cos 9, sin p sin 9, cos p) be a local minimum point of h x>z , 
then Lemma [3.91 yields that h'(9) exists and equals zero. On the other hand, 
by elementary calculation, 

sin(p + 1) 



h'{9) = sin/? sin # 



sm p sm l 



v/l — (sin(/3 + t) sin p cos 9 + cos(p + t) cos p) 2 
sin 14 

yl — (sin n sin p cos + cos u cos p) 2 
sin(/3 + 1) sinn 
sind(a;,p) sinci(z,p) 
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Hence we have necessarily 

sin(p + t) sinu 



0, 7r or 



sin d(x,p) sin d(z,p) 

Firstly, we observe that w, the corresponding point p when = tt, cannot 
be a minimum point. In fact, let w' be the antipode of w. If d(x, a) < 
d(w',a), then h XjZ (w) = d(x,z). So that w is a maximum point. On the 
other hand, ifd(x,a) > d(w',a), then d(w,x) + d(x, z)+d(z,w) =2tt. Hence 
Lemma 13.61 and the condition p + 1 + u < tt imply that w is not a minimum 
point. So that the assertion holds. 

Now assume that p ^ w,y such that 

sin(p + 1) __ sinu 
sin d(x,p) sin d(z,p) 

Let ft = Z zpa. Then by the spherical law of sines, ([3]) is equivalent to 
sin(ft + Z zpx) = sin f3, i.e. that Z zpx = tt — 2(3. Applying the spherical law 
of sines to Azpx we get 

sin Z x sin 2/3 



sin d(z,p) sin(p + i — it) 
By the spherical law of sines in Aapx, 

sin Z x sin /3 
sinp sin(/9 + t) 

Then Q/© gives that 



(4) 
(5) 



sin d(z, p) cos ft = Sm ^ Sm ^ ^ ^ ^ . (6) 
2sm(p + t) 



By the spherical law of cosines in Aazp, 



„ , _ cosu — cos p cos d(z,p) 

smdlz, p) cos ft = . (7) 

sin p 

Then ® and Q give that 

,, . 2 cos u sin(p + t) — sin 2 psinip + t — u) 

cos(i(z,p) = . . (8) 

2 cos p sm(p + t ) 

Moreover, by ([8]) and spherical law of cosines in Aazp, 

cos8 = ^-^(cotn + cot(p + t)). (9) 

The condition 0<p + t + u<ir and ([9]) give that 

tan p sinfp + t + n) 

cos 9 = yF ; . ; > 0. 10) 

2smnsm(p + t) 

Furthermore, considering p ^ y we must have cos (9 < 1. By ([9]) this is 
equivalent to 

cot u < 2 cot p — cot(p + t), (11) 
which is also equivalent to 

ship ~ u) < sint 
sinu sin(p + 1) 
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For the case when u > p it is easily seen that y is a maximum point of 
h XjZ and hence p must verify ([3]) and the corresponding 6 is determined by 
([9]). Hence in this case, there are exactly two local minimum points of h% % 
and obviously they are also global ones. 

Now let u < p, then easy computation gives 

h"(0) = sinpf - -^—) . (13) 

v ; H \ sint sin(p - u) J K ' 

Hence if S1 "^~j^ > sir ^" M ) , then (fT2|) yields that y is the unique global min- 
imum of h xz . In the opposite case, (|13p implies that y is a local maximum 
point. Hence the same argument as in the case when u > p completes the 
proof of lemma. □ 

We need the following technical lemma. 

Lemma 3.11. // cot u < 2 cot p — cot(p + t), then < d(x,p) — d{z,p) < tt 
for every p £ argmin 

Proof. It suffices to show d(x,p) > d(z,p). For the case when u < p, we 
firstly show that d{z,p) < tt/2. In fact, by ([8]) this is equivalent to show 
that 

(1 + cos 2 p) cosiisin(/) + t) + sin 2 psinucos(/> + t) > (14) 
If p + t < it/2, (|14p is trivially true. Now assume p + t > tt/2. So that 
7r/2 < p + t < tt — u, which implies sin(p + t) > sinn and cos(/? + i) > — cosk. 
Hence we get 

(1 + cos 2 p) cos u sin(p + 1) + sin 2 p sin u cos(p + t) 

> (1 + cos 2 p) cos u sin -u — sin 2 p sin « cos u 

=2 cos /) cos u sin u > 0. 

So that d(z,p) < tt/2 holds. Now if d(x,p) > tt/2, then obviously d(x,p) > 
d(z,p). So that assume d(x,p) < tt/2. Observe that p + t + u < tt implies 
sinn < sin(/9 + t), then ([3]) yields d{x,p) > d(z,p). 

For the case when u > p, it suffices to show that cosd(2;,p) > cosd(x,p) 
for every p = (sin /? cos 9, sin /? sin 0, cos p) with S [0, tt] . Now let 

g{9) = sin /) cos #(sin n — sin(p + t)) + cos /)(cos u — cos(p + i)) 

= cos d{z,p) — cos d{x,p) 

Then g'(6) = — sin p sin #(sinu — sin(/> + 1)). Observe that p-\-t + u < tt and 
u < p + t imply that sinn < sin(p + t), hence g{9) > g(0) = cos d(z,y) — 
cos d{x,y) > 0. The proof is complete. □ 

Proof of Proposition \3.8[ By Lemma 13.101 it suffices to consider the case 
when cot n < 2 cot p — cot(p + t). Let p E argmin h x>z , then by © and the 
spherical law of cosines in Aapx, 

. 2cos(o + 1) sinn + sin 2 psin(p + t — u) . . 

cosa(x,p) = . (15) 

2 cos p sin n 

Now let u = p — v, then p + t — u = i + i>. So that (JSJ and (fTo"j) become 

. 2cos(,9 - v) sin(/> + t) - sin 2 psin(t + v) 

cosa(2,n) = t r . (16) 

v F; 2cospsin(p + t) v ; 
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. 2cos(p + t) sin(p - v) + sin 2 psin(i + v) 

cosd{x,p) = — r . 17 

2 cos psm(p — v) 

It follows that 

cos d(z,p) cos d(x,p) = (4cos(p — v) sin(p + t) cos(p + t) sin(p — v) 

+ 2 sin 2 p sin 2 (f + v) — sin 4 p sin 2 (t + v) ) 
/(4cos 2 psin(p + i)sin(p- v)). (18) 

On the other hand, ([3]) and (fTBj) yield that 

• u \ • u \ r-i 2 u w sm (P + 1) 

sin d[z,p) sm alx,p) = ( 1 — cos d{z,p)) — ^ - 

sm(p — v) 

= (4sin 2 (p + i)(cos 2 p — cos 2 (p — v)) — sin 4 psin 2 (i + u) 
+ 4 sin 2 psin(i + v) sin(p + t) cos(p — f )) 
/(4cos 2 psin(p + i) sin(p — v)). (19) 

Then by {E) and CE} we obtain 

4 cos 2 psin(p + i) sin(p — t> )(cos(d(x,p) — d(z,p)) — cos(t + u)) 
= 4 cos 2 psin(p + i) sin(p — i>)(cos d(x,p) cos d(z,p) + sind(x,p) sind(z,p) — cos(t + v)) 
= 4cos(p — t>) sin(p — v) cos(p + t) sin(p + i) + 2 sin 2 pcos 2 psin 2 (i + v) 

+ 4sin 2 (p + i)(cos 2 p — cos 2 (p — v)) + 4sin 2 psin(p + t) cos(p — v) sin(t + v)) 

— 4 cos 2 p sin(p + i) sin(p — v) cos(i + v) 
= (—4 cos 4 p cos 2 v sin 2 f — 4 cos 4 p sin 2 i sin 2 v + 4 cos 4 p sin 2 i) 

+ (—8 cos p cost cos w ship sin i — 8 cos p cos t sin p sin t sin u + 8cos p cost sin p sin 

+ (— 4 cos 2 p cos 2 £ cos 2 t> sin 2 p — 2 cos 2 p cos 2 i sin 2 p sin 2 v + 4 cos 2 p cos 2 i sin 2 p) 

+ 4 cos p sin p sin t; cos v sin f cos t + 2 cos p cos t> sin p sin t 

ry ry ry ry 

= 2 cos p sin p cos £ sin v + 4 cos p sin p sin t> cos v sin t cos t + 2 cos p cos u sin p sin 
= 2 cos 2 p sin 2 p sin 2 (i + t?) . 

As a result, 

2 cos 2 p sin 2 p sin 2 (t + v) 



cos(d(x,p) — d(z,p)) = cos(t + v) + 
= cos(i + f ) + 



4 cos 2 p sin(p + t) sin(p — v) 

sin 2 p sin 2 (£ + v) 
2 sin(p + t) sin(p — u) 
sin 2 p sin 2 (p + i — u) 



= cos(p + t — u) + 

2smusm(p + 1) 

Now it suffices to use Lemma 13.111 to finish the proof. □ 
We also need the following lemma. 
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Lemma 3.12. Let k be real number and 1/2 < a < 1. For t G (0, p/{2a — 1)] 
de/me 

cot(y / K(2a — l)t) — co^-y/Ki) — 2 ^(-v/k/j), if k > 

^cwW = < (1 - a)p - (2a - l)t, i//e = 0. 

coth(- v /— k(2o — l)t) — coth(i/ = k£) — 2 coth(- v /— ftp), if k < 

Assume that 1/2 < a < 1, f/ien t/iere exists a unique t K G (0, p/(2a — 1)) 
swc/i i/tai 

{* G ( °' 2a^l ] : " °} = (0 ' tK] - 

/n i/tis case, u>/ien k < 0, the function F apK is strictly deceasing. 

Proof. We only prove the case when k = 1, since the proof of the other 
two cases are similar and easier. Observe that F atPt i{0+) = +oo (since 
1/2 < a < 1) and F a ^i(p/(2a - 1)) < (since 2ap/{2a - 1) < tt), then 
there exists some t\ G (0, p/(2a — 1)) such that F ai/ o,i(ii) = 0. Moreover, 



p/ / + x 1 ^ sin((2a - l)t) x2 , n 
F ^ l(t) = ^((2a-l)t) l ( 5Si } " (2a_1) 



Observe that the function Z(i) = sin((2a — l)i)/ sini is strictly increasing on 
(0,vr/(2a)], Z 2 (0+) = (2a-l) 2 < 2a-l and Z 2 (?r/(2a)) = 1 > 2a- 1. Hence 
there exists a unique s G (0,7r/(2a)) such that if t < s, then F' a p l (t) < 0; 
if t = s, then F' apl {t) = 0; if t > s, then F' apl {t) > 0. Hence 
is strictly decreasing on (0, s] and strictly increasing on [s, 7r/(2a)]. Since 
p/(2a — 1) < it /{2a) and F a ^ p ^{p/{2a — 1)) < 0, the point ii must be 
unique. Moreover, it is easily seen that {F a>P! i > 0} = (0,ii]. The proof is 
complete. □ 

The main theorem of this section is justified by the lemma below. 
Lemma 3.13. Assumption \3.5\ implies that 

'sin(VAt), ifA>0; 



< S ^ Where S *® ■= < 
V 2a — 1 l 



t, ^/A = 0; 

^sinh(^/^At), i/A<0. 



Proof. We only prove the case when A > 0, since the proof for the cases 
when A < are easier. Without loss of generality, we can assume that 
A = 1. Since (2a — l)r*/(2a) < tt/2, we have 

2ap 2a — 1 

< r* -<=^ sin p < sin ■ 



2a - 1 r 2a 

So that it is sufficient to show 

a 2a — 1 r* 
: sm — r* < sm — . 



V2a-1 2a 2 
To this end, let c = r*/2 G (0,7r/2], we will show that the function 
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is strictly increasing for a E (1/2, 1). Easy computation gives that 
,,, , n tan(flc) 2-9 

where 9 = (2a — l)/a E (0, 1). Observe that the function x i — > t&nx/x is 
increasing on [0,7r/2), hence it suffices to show that 

tan(6>vr/2) 2 - 9 



(6>vr/2) 1 - 9 
This is true because Becker-Stark inequality (see |.4j) yields 

tan(07r/2) 1 2-9 

< 77T < 



(Ott/2) 1-9 2 1-9' 
The proof is complete. □ 

Now we are ready to give the main result of this section. 

Theorem 3.14. The following estimations hold: 

i) If A > and C B(a, r*/2), then 

„ ^( 1 . / asin(\/Ap). 
Qu C B \ a, — = arcsm f 



V2a - 1 

Moreover, any of the two conditions below implies C B(a,r if /2): 
2ap r* 2ap r* r* 

^ t; b ) ^ — t > tt and F a,pA(^r - p) < °- 



' 2a- 1 _ 2 ' ' 2a - 1 2 v 2 

ii) If A = 0, then 



()„' B(a, (y> 



V ' y/2a - 1 / ' 



mj If A < 0, £/ien 



C -B ( a, — ^= arcsinh f V 



v V2a-1 

Finally, Lemma \3.13\ ensures that any of the above three closed balls is con- 
tained in the open ball B(a,r*/2). 

Proof. Firstly, we consider the case when A > 0. Without loss of generality, 
we can assume that A = 1. For every B(a, p), let t x = d(a, x) — p E 

(0, p/(2a — 1)]. By Propositions 13.11 and 13. 7\ if there exists some z on the 
minimal geodesic joining x and a such that u z = d(a, z) E [0, p + t x ) verifies 
p + t x + u z < r* and mm s ^ ^ h X}S > (1 - a) (p + t x - u z )/a, then x Q^. 
Or equivalently, 

Q^C\B{a,p) c 

C 1 1 £ B, \ B(a, p) : t x E (0, — - — ] has the property that for every u 2 E [0, p + tx) 
[ 2a — 1 

1 - a 1 

such that p + t x + u z < r*, min h xz < (p + i x — u z ) > := A. 

a i 
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Since the restrictive condition of the set A is only on t x , for simplicity and 
without ambiguity, by dropping the subscripts of t x and u z we rewrite A in 
the following form: 



t G (0, — - — ] : for every u G [0, p + t) such that p + t + u < r*, 
2a — 1 

. - 1 — a . . 

mm hx z S (p + £ — u) 

a 

£ € (0, — — p] : for every u G [0, p + such that p + t + it < r* 

- 1 — a 

mm h xz -< (p + t — u) 

a 



U < t G (— — ] : for every «£ [0, p + t) such that p + i + 

[ 2 2a — 1 



it < r*, 



1 — a 

min g < (p + i - u) )■ := BUC. 

a 



Observe that for t G (0, r*/2 — p] and u G [0, p+t), we always have p+i+u < 
r*, hence by Proposition 13.81 and Lemma I3.1U 

\ V — 1 — C£ 

B = < i G (0, — — p] : for every «£ [0, p + t), min h x % < (p + t — u) 

[2 a 

= |t G (0, — p] : for every u G [0, p + t) such that cot n > 2 cot p — cot(p + t), 

1 — a . . 

t — p + u < (p + c — u) 

a 

I 7* 

Pi < t G (0, — — p] : for every n G [0, p + 1) such that cot u < 2 cot p — cot(p + t) 

sin 2 psin 2 (p + t — u) A — a 

cos(p + + —z-. — f — —— > cos ( p + t - 

2 sm u sm{p + t) a 

i G (0, — — p] : for every u G [0, p + i) such that cot n > 2 cot p — cot(p + t), 
u< p - (2a- l)t 

Pi < t G (0, — — p] : for every n G [0, p + 1) such that cot u < 2 cot p — cot(p + t) 

. . sin 2 p sin(p + t — u) sin(p + t — u) 
sm(p + 1) < 



4sinu p + t — u .2a — 1 . 
sin sin ( (p + 1 — u) ) 



2a v 2a 



i G (0, y - p] : cot(p - (2a - < 2 cot p - cot(p + t) 
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^ \ t G (0, — — p] : for every u G [0, p + i) such that cot u < 2 cot p — cot(p + i), 

. sin 2 p sin(p + i — u) sin(p + t — u) 

sm(p + i) < 

4 sin it p + t — u 2a — 1 

sin— ^— sin (-^- (p + t-u)) 

c< ! * G (°j if - p] '■ for ever y u £ (p - (2a - l)t,p + t), 



2 



. . _|_ ^ < srn2 P sin(p + t — u) sin(p + t — u) 

~ 4 sin n p + t — u 2a — 1 . 

sin^— sin(^^(p + t-u)) 

sin 2 p 2a 



f , / a sin p s ] , . 

= Ue(0,j-p]: p + t<arcsin( ^_il )j. (20) 

Hence if Q M C B(a, r*/2), then C = (p and ([20]) says that 

^ i=,f ■ i asinp A 
Qa C B[ a, arcsm : , 

M V V2a-lV 
this completes the proof of the first assertion of i). To show the second one, 
observe that if a) holds, then Theorem 13.21 implies the desired result. Hence 
assume that b) holds. By Proposition 13.81 one has 

Cc<t£ (— — p, ] : for every u G [0, r* — (p + t)) such that 

[ 2 2a — 1 

cot u > 2 cot p — cot (p + t), 

1 — a . . 

c — p + It < ( p + t — u ) 

a 

={t € (y - p, ^-y] : cot(p - (2a - l)t) < 2 cot p - cot(p + t) J. 

Observe that for t G (r*/2 — p, p/(2a — 1)) we have 

cot(p - (2a - < 2 cot p - cot(p + t) 
<J=^ cot(p — (2a — l)t) — cot p < cot p — cot(p + t) 
sin(p — (2a — l)t) ^ sin(p + t) 

sin((2a — l)t) ~ sini 
cot ((2a — l)t) — cot p > cot p + cot i 

^^a,p,l(*)> - 

Hence 

C C |t G - p, : F a , p>1 (t) > o| := D. 

If a = 1, clearly D = (p. Now let 1/2 < a < 1, then Lemma [3.121 yields that 

D = (J±-p, ^_ I ]n(o,ti] = 0. 
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Thus C = 4> still holds, that is, Q p C B(a, r*/2). The proof of i) is complete. 

Now let us turn to the proof of ii) and in). In fact, the proof for these two 
cases are essentially the same as that of i) except to note that we no longer 
need to assume that C B(a, r*/2), because this is implied by Assumption 
13.51 To see this, if 4ap < (2a — l)r*, then it suffices to use Theorem 13.21 
So that let us assume 4ap > (2a — l)r* and show that F aiPi &(r*/2 — p) < 
for A G {—1,0}. This is trivial if A = or a = 1, hence let A = —1 and 
a G (1/2,1). Since r*/2 — p > (1 — a)p/(2a — 1) and F a>Pt _i is strictly 
decreasing, it suffices to show that Fq, jPj _i((1 — a)p/(2a — 1)) < 0. To this 
end, define f(a) = F ajPi _i((l — a)p/(2a — 1)), easy computation gives that 

f (a) = sinh 2 ((l -a)p) ~ (2a-l) 2 swh 2 (±^p) > °' 

because the function x i — > sinhx/x is strictly increasing. Hence f(a) < 
/(l— ) = 2(p~ l — cothp) < 0. The proof is complete. □ 

Remark 3.15. When A > 0, Assumption 13.51 does not imply the condition 
b) in i). In fact, in the case when M = § 2 , we have r* = it and A = 1. 
Then let a = 0.51 and p = 0.99vr(l - (2a)" 1 ), then 2ap/(2a - 1) e (vr/2 + 
1.5393, vr - 0.0314), but F atPtl (n/2 - p) « 0.2907 > 0. 

Remark 3.16. It is easily seen that if we replace r* by any r G (0, r#] in 
Assumption 13.51 then Lemma [3.131 still holds when is replaced by r. This 
observation can be used to reinforce the conclusions of Theorem 13.141 For 
example, in the case when A > 0, 

2«p r* . s / 1 . ,asin(\/Ap)A _ D / rA 
< — implies that Q p C nya, — — arcsin (—j==-) J C Byi, — j 

Remark 3.17. Although we have chosen the framework of this section to 
be a Riemannian manifold, the essential tool that has been used is the hinge 
version of the triangle comparison theorem. Consequently, all the results in 
this section remain true if M is a CAT(A) space (see [6j Chapter 2]) and r* 
is replaced by 7r/vA in Assumption 13.51 

Remark 3.18. For the case when a = 1, Assumption 13.51 becomes 

1 . r i ... 
p < - mm\ —= , mi \. 

2 l v / A 



Observe that in this case, when A > 0, the condition i ? i,p,A( r '*/2 — p) < is 
trivially true in case of need. Hence Theorem 13 . 141 yields that Q p C B(a, p), 
which is exactly what the Theorem 2.1 in [1] says for medians. 

4. UNIQUENESS OF FRECHET SAMPLE MEDIANS IN COMPACT 
RIEMANNIAN MANIFOLDS 

In this section, we shall always assume that M is a complete Riemannian 
manifold of dimension I > 2. The Riemannian metric and the Riemannian 
distance are denoted by ( • , • ) and d, respectively. For each point x G M, 
S x denotes the unit sphere in T X M. Moreover, for a tangent vector v G S x , 
the distance between x and its cut point along the geodesic starting from x 
with velocity v is denoted by t(v). Certainly, if there is no cut point along 
this geodesic, then we define t(v) = +oo. 
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For every point (x\ , . . . , xn) G M , where iV > 3 is a fixed natural 
number, we write 

1 N 

H(xi,...,X N ) = Jr^2 S ^k- 
k=l 

The set of all the Frechet medians of [a(xi, . . . , xat) is denoted by Q(xi, ■ ■ ■ ,xn). 

We begin with the basic observation that if one data point is moved 
towards a median along some minimizing geodesic for a little distance, then 
the median remains unchanged. 

Proposition 4.1. Let (x\, . . . , xn) G M N and m G Q(x\, . . . ,xn). Fix a 
normal geodesic 7 : [0, +00) — > M such that 7(0) = x\, j(d(xi,m)) = m. 
Then for every t G [0, d(xi,m)] we have 



Q(l(t),x 2 , ...,x N ) 



Q(xi,...,XAr)n 7 [t,r(7(0))], i/r(7(0)) < +00; 

Q(xi, . . .,x N ) n 71 [t, +00), ifr(j(0)) = +00. 

Particularly, m G Q(j(t), X2, ■ ■ ■ ,xjy). 

Proof. For simplicity, let \x = (a(xi, . . . , xn) and fit = /J,( r y(t),X2, ■ ■ ■ ,xn). 
Then for every x G M, 

-d(x, Xl ) + -. 



fnt(x) - f m (m) = f^(x) - —d(x,xi) + — d(x,-y(t)) 



U(m) - —d(m,xi) + —d(m,j(t)) 

= (u(x)-U(m)\ + (d(x,i(t))+t-d(x, Xl ) \ >0. 

So that m G Q^ t - Combine this with the fact that m is a median of fj,, it is 
easily seen from the above proof that 

Q^ t = Qf,r){x £ M : d(x, 7(i)) + 1 = d(x, x{)}. 

Now the conclusion follows from the definition of r(7(0)). □ 

The following theorem states that in order to get the uniqueness of Frechet 
medians, it suffices to move two data points towards a common median along 
some minimizing geodesies for a little distance. 

Theorem 4.2. Let {x\, . . . , rcjy) G M N and m G Q(x\, . . . , xn). Fix two 
normal geodesies 71,72 : [0, +00) — > M such that 71 (0) = x\, 7i(d(xi, m)) = 
m ; 72(0) = X2 a^rf J2(d(x2,m)) = m. Assume that 



x 2 $ 



7i[0,r( 7l (0))], i/r( 7 i(0)) < +00; 
^7i[0,+oo), i/r(7i(0)) = +00. 

Then for every t G (0, d{xi,m)] and s G (0, d(x2,m)] we have 
Q(7l(*)i 72(s),a;3, . . . ,x N ) = {m}. 

Proof. Without loss of generality, we may assume that both r(7i(0)) and 
r(72(0)) are finite. Applying Prop osit ion 14 . 1 1 two times we get 

Q(7i0)> 72(s), x 3 , . . . , x N ) C Q(a?i, . . . , x N ) n 71 [i, r( 7 i(0))] n 72 [s, t( 72 (0))]. 
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Since 22 ^ 7i[0> r (7i(0))]> the definition of cut point yields 7i[i, r(7i(0))] D 
72 [s, ^(72(0))] = {m}. The proof is complete. □ 

We need the following necessary conditions of Frechet medians. 

Proposition 4.3. Let (x\,...,xn) € M N and m £ Q(xi, . . . , xn). For 
every k = 1, . . . , N let 7^ : [0, d(m, Xk)] — > M be a normal geodesic such that 
7A;(0) = m and j k (d(rn,x k )) = x k - 
i) If m does not coincide with any x k , then 

N 



£>fe(0) = 0. (21) 



k=l 

In this case, the minimizing geodesies 71, . . . , 7tv are uniquely determined, 
ii) If m coincides with some Xk , then 

£ 7fc(0) < E L ( 22 ) 

Proof. For sufficiently small e > 0, Proposition 14.11 yields that m is a median 
of /x(7i(e),...,7jv(e)). Hence [El Theorem 2.2] gives fl2H and ([22]). Now 
assume that m does not coincide with any x k and, without loss of generality, 
there is another normal geodesic Ci : [0, d(m, x{j\ — > M such that Ci(0) = m 
and Ci(d(m,xi)) = x x . Then §Z\§ yields Ci(0) + E^=2 7fc(°) = °- So that 
Ci(0) = 7i(0), that is to say, Ci = 7i- The proof is complete. □ 

From now on, we will only consider the case when M is a compact Rie- 
mannian manifold. As a result, let the following assumption hold in the rest 
part of this section: 

Assumption 4.4. M is a compact Riemannian manifold with diameter L. 

In what follows, all the measure-theoretic statements should be under- 
stood to be with respect to the canonical Lebesgue measure of the underlying 
manifold. Let Am denote the canonical Lebesgue measure of M. 

The following lemma gives a simple observation on dimension. 

Lemma 4.5. The manifold 

f N 

V = < (x, n\, . . . , njy) : x £ M, (nk)i<k<N C S x , = and there exist 

^ k=l 

k\ 7^ ki such that nk 1 and nk 2 are linearly independent 

is of dimension N(l — 1). 
Proof. Consider the manifold 

W = |(x, m, . . . , n^v) : x £ M, (rik)i<k<N C S x and there exist k\ 7^ &2 

such that nk ± and nk 2 are linearly independent | , 
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then the smooth map 

JV 

/: W — >TM, (x,nt,...,n N ) i — > 



k=l 



has full rank I everywhere on W. Now the constant rank level set theorem 
(see [21 Theorem 8.8]) yields the desired result. □ 

Our method of studying the uniqueness of Frechet medians is based on 
the regularity properties of the following function: 

ip: V x ¥L N — ► M N , 

(x,rii, . . . ,n N ,ri, ...,r N ) i — > (exp x (rim), . . . , exp x (r N n N )). 

For a closed interval [a,b] C R, the restriction of tp to V x [0,6] will be 
denoted by (f a ,b- The canonical projection of V x R^ onto M and R^ will 
be denoted by a and £, respectively. 

Generally speaking, the non uniqueness of Frechet medians is due to some 
symmetric properties of data points. As a result, generic data points should 
have a unique Frechet median. In mathematical language, this means that 
the set of all the particular positions of data points is of measure zero. Now 
our aim is to find all these particular cases. Firstly, in view of the uniqueness 
result of Riemannian medians (see [19|, Theorem 3.1]), the first null set that 
should be eliminated is 

C\ = Iax\, . . . ,xn) G M N : x\, . . . , xn are contained in a single geodesic 1. 

Observe that C\ is a closed subset of M N . The second null set coming into 
our sight is the following one: 

C2 = I (xi , ■ ■ ■ , xjs;) G M N : (x\ , . . . , xn) is a critical value of ipo,L 1 • 

Since <p is smooth, Sard's theorem implies that C2 is of measure zero. More- 
over, it is easily seen that (M N \ C\) H C% is closed in M N \ C\. 

The following proposition says that apart form C\ U C2 one can only have 
a finite number of Frechet medians. 

Proposition 4.6. Q(x%, . . . , a?jv) is a finite set for every (xi, . . . , xn) G 
M N \(C 1 UC 2 ). 

Proof. Let (xi, . . . ,x N ) G M N \ (C\ U C 2 ) and A C Q{x\, . . . ,x N ) be the 
set of medians that do not coincide with any xt- If A = <j>, then there is 
nothing to prove. Now assume that A 7^ cj), then Proposition 14.31 implies 
that A C a o ip^ L {xx, ■ ■ ■ ,xn). Moreover, Lemma 14.51 and the constant 
rank level set theorem imply that ip^ L {x\, ■ ■ ■ , xn) is a zero dimensional 
regular submanifold of V x [0, L] , that is, some isolated points. Since 
(xi, . . . ,xn) 4- Ci) V 9 ol( Xl ' • • • -> x n) is also compact, hence it is a finite set. 
So that A is also finite, as desired. □ 



The following two lemmas enable us to avoid the problem of cut locus. 
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Lemma 4.7. Let U be a bounded open subset of V x such that ip : 
U i — > tp(U) is a diffeomorphism, then 

\f* f f(x 1 ,...,x N ) G tp(U) :aoip- 1 ( Xl ,...,x N ) G |J Cut(x A )| =0. 
^ fc=l ^ 

Proof. Without loss of generality, we will show that Af^ {(xi, ■ ■ ■ , xn) G 
M N : a o ip~ l (xi, . . . ,xn) G Cut(xjv)} = 0. In fact, letting [x\, . . . , xjv) = 
<p(x, rix, . . . njy, ri, . . . , rjv), n = (n\, . . . njy) and det (Dip) < c on U for some 
c > 0, then the change of variable formula and Fubini's theorem yield that 

^M^ii^U ■ ■ ■ ,x N ) G <p(U) : o o tp~ 1 (xi, . . .,x N ) G Cut(xAr)} 

= \f I N {(xi, . . . ,x N ) G (p(U) : x N G Cut((T o 99 _1 (xi, . . . ,x N ))} 

1 {x N eCui(ao t p-i(x 1 ,...,x N m dx l ■ ■ ■ dxN 

<p(U) 

= / 1 {ex Pa; (r J vn J v)gCut(aO} det(Dcp)dx dn dr x . . . dnv 

< c / llcxp^frjvnjvjeCut^)}^ (in dri . . . dr^ 

= C / dxdndri... dr N -x / llexp^fr^nwleCutfa:)}^ 

JyxR^- 1 Jr 
= 0. 

The proof is complete. □ 

In order to tackle the cut locus, it is easily seen that the following null 
set should be eliminated: 

C3 = / (xx, ■ ■ ■ ,xn) G M n : Xi G {xj} U Cut(xj), for some % 7^ j X. 

Observe that C3 is also closed because the set {(x,y) G M 2 : x G Cut(y)} 
is closed. 

Lemma 4.8. For every (xx, ■ ■ ■ , xjv) G M \ (C\ U C2 U C3), i/zere exists 
5 > suc/i f/iat 

f " 

j (yi,...,2/Ar) G fl(zi,*)x---xS(xjv,<y) : Q(yi,.",yAr)n|J Cut(y fc ) + 

^ fc=l 

Proof. If Q(xi, . . . , xjv) C {xi, . . . , xtv}, then the assertion is trivial by The- 
orem 12.31 Now assume that Q(xx, ■ ■ ■ , xjv) \ {xx, ■ ■ ■ ,xn} / 4>, then the 
proof of Proposition 14.61 yields that ipQ L (xx, ... ,xn) is finite. Hence we can 
choose e, n > and O a relatively compact open subset of V such that e < 
min{injM/2,min{r fc : (n, ...,rjv) G C ^o\l( x i>- • • > x A r )^ = • ■ -j-^}}) 
B( Xi ,2e) n Cut(B(xj,2e)) = </> and p~l +r] (xx,...,x N ) C O x (e,L + rj) N . 
Then by the stack of records theorem (see (TTJ Exercise 7, Chapter 1 , Sec- 
tion 4]), there exists 5 G (0,e) such that U = B(xx,5) x ••• x B(xn,5) 
verifies <p~^ L+v (U) = Vx U • • • U V h , Vj PI Vj = (f> for « 7^ j and ^E,i+r? : 
Vi — )• is a diffeomorphism for every i. Lemma 14.71 yields that there ex- 
ists a null set A C U, such that for every (yx,- ■ ■ ,Vn) G U \ A and for 



A 



Ai 
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every (y,m, . . . ,n N ,n, . . . ,r N ) G <p~ji +r) (yi, ■ ■ ■ ,Vn) one always has y £ 
UfeLi Cut(yfc). Particularly, for m G Q(yi, • • • , 2/at) such that d(m, y^) > e for 
every fe, we have m ^ Ufc=i Cut(yjfc). Now let m G Q(yi, . . . , yjv) such that 
d(m,y ko ) < e for some k , then d(m,x ko ) < d(m,y ko ) + d(y ko , x ko ) < 2e. So 
that m ^ UfcLi Cut(yfc) since y^ G B(x k , 2s). This completes the proof. □ 

Now the cut locus can be eliminated without difficulty 

Proposition 4.9. The set 

C 4 = S.( Xl ,...,x N ) £M N : Q( Xl ,...,x N )n \J Cut(x fc ) ^4>\ 
^ k=i ' 



is of measure zero and is closed. 



Proof. It suffices to show that C4 is of measure zero. This is a direct con- 
sequence of Lemma 14.81 and the fact that M \ (Ci U C2 U C3) is second 
countable. □ 

Let x, y G M such that y ^ {x}uCut(x), we denote 7 xy : [0, d(x,y)] — > M 
the unique minimizing geodesic such that 7^(0) = x and r y xy (d(x,y)) = y. 
For every u G T X M and u G T y M, let J(v,u)(-) be the unique Jacobi field 
along 7^ with boundary condition J(u,v)(0) = u and J(u,v)(d(x,y)) = v. 

Lemma 4.10. Let x,y G M such that y ^ {x} U Cut(x). Then for every 
v G T y M, we have 

v ^e^0 = ^ no 
d(x,-J 

where v nor is the normal component of v with respect to A f xy {d{x 1 y)). 
Proof. By [2, p. 1517], 

ex Ps 1 (0 V^exp" 1 ^) exp" 1 yV v d(x, ■) 



d(x, ■) d(a?,y) d(x,y) 2 

V^exp-^-) /_ - exp" 1 x \ exp- 1 y 



V, 



d(x,y) \"' d(x,y) / d(x,y) 2 
=j(0 x ,v)(0)- j(0,,^ tan )(0) 
=.7(0*, ^(O), 

where t> tan is the tangent component of u with respect to j xy (d(x,y)). □ 

With this differential formula, another particular case can be eliminated 
now. 

Proposition 4.11. The set 

( ^ exp" 1 x k •) 

C B = \(x 1 ,...x N )€ M N \ (Ci U C 3 ) : V - k ° = 1 /or some k 
1 fc#fc Wo^fcJ J 

is 0/ measure zero and is closed in M N \ (C\ U C3). 
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Proof. Without loss of generality, let us show that 

C' 5 = l(x 1 ,...,x N )eM N \(C 1 UC 3 ) : h( Xl ,...,x N ) 



N-l _i 



is of measure zero, where 

h(x\, . . .,xn) 



fc= -J d{x N ,x k ) 

By the constant rank level set theorem, it suffices to show that grad/i is 
nowhere vanishing on M N \ (C\ U C3). To this end, let (xi, . . . , xn) G 
M N \ (C\ U C3) and u = J2k=i ex P-r^ x k/d(xN,Xk)- Since N > 3, without 
loss of generality, we can assume that u and exp~^ x\ are not parallel. Then 
for each v G T X1 M, by lemma 14,101 we have 



V v h(-,x 2 , ■ ■ -,x N ) 



2(J(0 XN ,v™)(0),u)=2(^(v),u), 



v d{x N ,xi)' 

where the linear map ip is defined by 

%b : T X1 M — > T XN M, v .— > j(0 Xff , v nor )(0), 

v nor is the normal component of v with respect to exp^ 1 xjy. Hence we have 
grad 2 . 1 h(-,X2,- • • ,xn) = ~P*( U ), where ip* is the adjoint of ip. Since the range 
space of ip is the orthogonal complement of exp~^ x\, one has necessarily 
tp*(u) 7^ 0, this completes the proof. □ 

The reason why the set C5 should be eliminated is given by the following 
simple lemma. 

Lemma 4.12. Let (x±,... ,xn), (x\, ■ ■ ■ , x l N ) G M N \ C3 for every i G N 
and (x\, . . . , x l N ) — > (xi, . . . , xn), when i — > 00. Assume that mi G 
Q(x\, ... , x l N ) \ {x\,..., x l N } and rrn — ► x ko , then 

-1 



1. 



Proof. It suffices to note that for i sufficiently large, Proposition 14.31 gives 



k=£ko 



1 „i 



ex Pm,i x k 



d(mi, X l k ) 



— 1 i 
eX Pm, x k 



d(mi,x l ko ] 



1. 



□ 



Then letting i — > 00 gives the result. 

As a corollary to Proposition 14.111 the following proposition tells us that 
for generic data points, there cannot exist two data points which are both 
Frechet medians. 

Proposition 4.13. The set 

C 6 = . . . ,x N ) G M N \ (d U C 3 U C s ) : 

i/iere exis£ i ^ j suc/i i/iat f^x-i) = fn(xj), where /i = . . . , xj\r) 
is 0/ measure zero and is closed in M N \ (C\ UC3U C5). 
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Proof. For every (x u . . . , x N ) G M N \ (Ci U C 3 U C 5 ), let / (xi, . . . , x N ) = 
/ M (acjv_i) and g(xi, . . . ,x/v) = f^{x N ), where /j = /i(xi . . . ,x N ). Without 
loss of generality, we will show that {(x\, . . . ,xjv) G M w \ (Ci UC3U C5) : 
f(xi, . . . , xn) = g(xi, ■ ■ ■ , xn)} is of measure zero. Always by the constant 
rank level set theorem, it suffices to show that grad / and gradg are nowhere 
identical on M N \ (C x U C 3 U C 5 ). In fact, 



grad /(xi, 



,x N j 



-exPxl x N -i 
d(x N ,x N -i) 



N-l 

^E 

k=l 



- exp Xj ^ x k 
d(x Nl x k ) 



&ad x g(x 1 ,. 



because Ck is eliminated, as desired. 



□ 



As needed in the following proofs, the restriction of cp on the set 

E = |(x, m, ■ ■ ■ , riN,ri, . . . , rjy) G V x H N : < r& < r(rife) for every fc| 

is denoted by let <p. Clearly, (p is smooth. 

The lemma below is a final preparation for the main result of this section. 

Lemma 4.14. Let U be an open subset of M N '\Ujfe_i Cfc- Assume that UiU 
U2 C p~ l (U) such that for i = 1,2, <£j = ^|^. : Ui — > C/ is a diffeomorphism 
and o-(Ui) n c^L^) = 4>. For simplicity, when (x\, . . . , xn) G [7, we write 
x = <7 o (^^ 1 (xi, . . . ,xn), y = o~o (p~ (xi, . . . ,xn) and /j, = /i(xi, . . . , x/v). 
T/ien £/ie following two sets are of measure zero: 



(xi, . . . , x/v) G fJ : /„(x) = fjy) > and 



(xi, . . . ,Xjv) G U : there exists such that /^(x) = f^{x ko ) >. 



Proof. We only show the first set is null, since the proof for the second 
one is similar. Let /i(xi, . . . , xjv) = f^(x), f 2 (x 1 , . . . , x N ) = f^(y) and 
w k G T Xk M. Then the first variational formula of arc length (see U p. 5]) 
yields that 

' ' /ltexp^^Wi), . . . exp x{t) (tw N )) 



dt 

A' 



t=0 



Hence 



=E 

k=l 

N 

-E 

/«=i 

N 

=E 

fc=i 
grad/i(xi 



~ a; 
d(x fc ,x) 



x(0), 



exp x x k 



d(x,x k ) 



d(x k ,x) 

— exp^,^ x 
d(x k ,x) 



,XN) 



-,w k 



-,w k 



N 



Ko),E 



exp a . x fc 
J d{x,x k ) 



exp Xl x 



exp 1 x 



d(xi,x) ' ' d(xN,x) 
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Observe that (xi, . . . , xn) £ C±, N > 3 and x ^ y, we have grad f\ 7^ grad /2 
on U. Then the constant rank level set theorem yields that {/1 = f 2 } is a 
regular submanifold of U of codimension 1, hence it is of measure zero. The 
proof is complete. □ 

The following theorem is the main result of this section. 

Theorem 4.15. fj>(xi, . . . ,xjv) has a unique Frechet median for almost ev- 
ery (x±, . . . ,xn) S M . 

Proof. Since M N \ Ufc=i Cfc is second countable, it suffices to show that 
for every (x\, . . . , xn) G M n \ {J k=i Ck, there exists 5 > such that 
. . . , yjv) has a unique Frechet median for almost every (yi, . . . , y^r) £ 
501,(5) x ••• x B(x N ,6) . In fact, let (xi, . . . ,x N ) £ M N \[f k=1 C k , 
without loss of generality, we can assume that Q{x\, . . . , xjv) = {y, z,xn}, 
where y, z ^ {x\, . . . ,xn}- Assume that Y = (y, m, . . . , n^, ri, . . . , rAr) and 
Z = (2;, Ui,... ,vw,ti,... ,tjv) G • • • , a^Af)- Since (xi, . . . ,xjv) is a 

regular value of 0, we can choose & 5 > such that there exist neighborhoods 
U\ of y and U2 of Z such that for i = 1, 2, = <p\jji '■ Ui — > U is diffeomor- 
phism, a(Ui) n cr(J7 2 ) = and B(xiv,(5) n (o-(f/i) U o-(U 2 )) = (f), where U = 
B(x\,5) x ••• x B(xn,5). Furthermore, by Theorem 12.31 and Lemma 14.121 
we can also assume that for every (yi, . . . , yjy) G B(x\, S) X • • • X B(xn, <5), 
Q(y!,...,y N ) C B(x N ,6) U a(Ut) U a(U 2 ) and Q(yi, . . . , yjy) n S(xjv, 5) C 
{yAr}. Now it suffices to use Lemma 14.141 to complete the proof. □ 

Remark 4.16. In probability language, Theorem 14.151 is equivalent to say 
that if (X±, . . . ,Xn) is an M^-valued random variable with density, then 
n(X\, . . . , Xn) has a unique Frechet median almost surely. Clearly, the same 
statement is also true if Xi, . . . , Xjy are independent and M- valued random 
variables with desity. 
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